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1 . Field of the Invention 

The present invention relates to computer system 
architecture. More specifically, the present invention relates to 
retrieving performance monitor data from an I/O processor. 

2 . Background Information 

Electronic products may be thought of as those products that 
involve the controlled conduction of electrons or other charge 
carriers, especially through microprocessors. Examples of 
electronic products include radios, computers, work stations, and 
servers as well as those involved in high-end networking and 
storage technology. Just about all electronic products employ one 
or more microprocessors disposed within a chip located on a 
printed circuit board. These microprocessors engage a computer 
operating system as well as applications. The main central 
processing unit within the chip includes a host system. It is 
this host system that runs the computer operating system and the 
applications . 

One type of processor within the host system is a host 
processor having a host memory. Another type of processor that 
may be within the host system is an inpur-output (I/O) processor. 
The I/O processor or I/O Platform (IOP) is a component of the host 
system that connects the host system memory to an I/O device to 
process I/O transactions. The I/O device may be a part of or 
external to the host system through at least one of a primary bus 
and a secondary bus. Examples of I/O devices include storage 
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devices such as a small computer systems interface (SCSI) 
controller for a disk and networking devices such as an Ethernet 
controller. 

One main function of a host system is to transmit data 
between the host memory and an I/O device via the I/O processor. 
Transmitted data includes application data, local area network 
(LAN) packets, and contents stored on a disk. To accomplish data 
transmission, data handling and processing units such as a core 
processor and a local memory are included within the I/O 
processor. The core processor and the local memory are coupled to 
each other through an internal bus and to a messaging unit and a 
direct memory access unit through that same internal bus. 
Ideally, the system performs within established parameters. 

To measure and monitor various system parameters that 
contribute to the overall performance of the I/O processor, a 
performance monitoring unit is integrated into the I/O processor. 
Under current standards, the tasks of the performance monitoring 
unit include compiling performance measurements on the three 
buses: the primary bus; the secondary bus; and the internal bus. 
The measurements of the performance monitoring unit can be used to 
refine code for improved system level performance. However, these 
measurements exist in raw, binary data for which no mechanism 
exists that gathers and compiles this raw, I/O performance monitor 
data into a form that readily is usable by a computer programmer 
or operator. 
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The present invention relates to retrieving performance 
monitor data from an I/O processor. A performance monitoring 
driver coupled to a performance monitoring unit is registered as a 
private driver with a real time operating system of the I/O 
processor. Events within the I/O processor are selected on which 
to gather data. The selected events are sent as a message request 
to the real time operating system. The message request is 
translated into the appropriate parameters based on a set of 
private group parameters that are accessible by the real time 
operating system. The message request is sent as a translated 
request to the performance monitoring unit. The pieces of data 
requested by the translated request are returned to the 
performance monitoring driver. The pieces of data then are sent 
to a location specified in the message request. 
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Figure 1 is a functional block diagram of an I/O processor; 

Figure 2 is a block diagram of a networking system 200; 

Figure 3 illustrates a more detailed view of a host 
processor and an I/O processor; and 

Figure 4 is a flow diagram of a method of operation 500 of 
the invention. 
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Figure 1 is a functional block diagram of I/O processor 10. 
An example of a known processor functional block diagram is 
illustrated and described for the Intel® i960® RM/RP 
Microprocessor as set out in Intel Corporation, i960® RM/RN I/O 
Processor Developer's Manual, pp. 1-1 through 1-12 (1st ed. July 
1998) . The description regarding Figure 1 is based on the Intel® 
i960® RM/RP Microprocessor. 

As show in Figure 1, I/O processor 10 integrates core 
processor 14 into a Peripheral Components Interconnect (PCI) 
functionality so as to address the needs of intelligent input- 
output applications ("intelligent I/O" or "l 2 0" applications) . 
Intelligent I/O applications may be coupled to primary PCI bus 16 
and/or secondary PCI bus 18. Preferably both PCI bus 16 and PCI 
bus 18 are industry standard, 64-bit/32-bit, high performance, low 
latency system buses coupled together by PCI-to-PCI bridge 20. A 
specification for the PCI bus is set forth in the document PCI 
Local Bus Specification, revision 2.1, October, 1994. This manual 
is prepared and maintained by the PCI Special Interest Group (PCI- 
SIG) . The PCI-SIG is an organization that is open for membership 
to all companies in the computer industry. 

Along with providing a connection path between the two 
independent PCI buses 16 and 18, bridge 20 provides the ability to 
overcome PCI electrical loading limits by allowing certain bus 
transactions on one PCI bus to be forwarded to the other PCI bus. 
Core processor 14 is indirectly connected to PCI-to-PCI bridge 20. 
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Bus interface unit 24 couples core processor 14 to internal bus 
26. In turn, internal bus 26 is coupled to primary transfer group 
28 and secondary transfer group 30. Internal bus 26 may be a 64- 
bit bus. PCI-to-PCI bridge 20 is coupled to primary transfer 
group 28 through primary PCI bus 16 and is coupled to secondary 
transfer group 30 through secondary PCI group 18, each path of 
which provides a link to core processor 14. By communicatively 
connecting core processor 14 to bridge 20, core processor 14 gives 
intelligence to bridge 20 to better address the needs of 
intelligent I/O applications coupled to primary PCI bus 16 and/or 
secondary PCI bus 18. 
jf PCI agents 100 may be coupled to either primary PCI bus 16 or 

y secondary PCI bus 18 so as to interact with one of the transfer 
€1 groups, 28 and 30. PCI agents 100 may include external PCI 
63-5 devices or a host processor, such as host processor 240. Within 

O PCI agent 100 may be PCI memory having PCI address spaces. 

k% - 

W Internal bus 26 may be coupled to local memory 38 through 

J35RS1 

dy memory controller 40. Local memory 38 includes memory systems 

external to I/O processor 10 that do not require external logic. 
20 Examples of local memory 38 include Synchronous Dynamic Random 
Access Memory (SDRAM) , Read-Only Memory (ROM) , and Flash memory. 

Primary transfer group 28 preferably is composed of Address 
Translation Unit 32, two Direct Memory Access channels 34, and 
messaging unit 36. Secondary transfer group 30 preferably is 
25 composed of Address Translation Unit 42 and One DMA channel 44. 

Address Translation Unit (ATU) 32 allows transactions between 
the PCI address space within PCI agent (or "agents") 100 and 
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address space 46 within local memory 38. Address translation may 
be controlled through programmable registers (not shown) that are 
accessible from both PCI agent 100 and core processor 14. ATU 42 
functions similarly to ATU 32, but performs on secondary PCI bus 
18 for PCI agents 100 coupled to secondary PCI bus 18. Dual 
access to registers through ATU 32 and ATU 42 allows flexibility 
in mapping the coupled address spaces. 

To insure low latency and high throughput data transfers 
between PCI agents 100 and local memory 38, three separate DMA 
channels are provided as shown in Figure 1. Two Direct Memory 
Access (DMA) channels 34 are included with primary transfer group 
28 and one DMA channel 44 is included with secondary transfer 
group 30. The three DMA channels operate as a DMA controller to 
support chaining and unaligned data transfers. This DMA 
controller is only programmable through core processor 14. 

The I 2 0 Architecture Specification describes an open 
architecture for developing device drivers in a system 
environment. Conventionally, based on the I 2 0 Architecture 
Specification, messaging unit (MU) 36 provides data transfer 
between PCI agents 100 coupled to primary PCI bus 16 and core 
processor 14. MU 36 can be used to send and receive messages. 
Moreover, MU 36 interrupts PCI bus agents 100 or core processor 14 
when new data arrives and passes the data as directed. 

Core processor 14 is interfaced with internal bus 26 through 
bus interface unit 24. Local memory 38 is coupled to internal bus 
26 through memory controller unit 40. Microcontrollers 56 are 
interfaced with internal bus 26 through the series of Inter- 

042390. P6518 - 7 - VPT/ab 

Express Mail No: EM522828818US 



Integrated Circuit (I 2 C) serial bus 50 and I 2 C bus interface unit 
52. Both local memory 38 and microcontrollers 56 are external to 
I/O processor 10. Application accelerator unit 54 is also coupled 
to internal bus 26. 

Memory controller 40 allows direct control of local memory 
38. Core processor 14 operates out of local memory 38 where this 
memory space is independent of PCI agents 100. Bus interface unit 
(BIU) 24 forwards accesses to core processor 14 to internal bus 26 
without translation. Microcontrollers 56 perform management 
functions for the systems of I/O processor 10. Application 
accelerator unit (AAU) 54 executes data transfers to and from 
local memory 38 on behalf of core processor 14 as set out for the 



I 2 0 standard. 



I/O processor 10 also includes internal arbitration unit 60 
to serve as arbiter for the systems of internal bus 26, secondary 
PCI arbitration unit 62 to serve as arbiter for the secondary PCI 
bus 18, and performance monitoring unit (PMON) 64. 

As noted above, performance monitoring unit (PMON) 64 may be 
used to compile performance measurements on the three buses: 
primary PCI bus 16, secondary PCI bus 18, and internal bus 26. 
The compiled measurements of PMON 64 can be used for performance 
analysis or real-time system tuning by refining code for improved 
system level performance. 

Within the l 2 0 Architecture, data is stored in parameter 
groups on I/O processor 10 such as in local memory 38. Data in 
local memory 38 may be modified or extracted by host processor 240 
through a message sent by host processor 240 to I/O processor 10. 

042390. P6518 - 8 - VPT/ab 

Express Mail No: EM522828818US 



However, the I 2 0 Architecture does not provide any mechanism for 
gathering performance measurements compiled by PMON 64. The below 
embodiments of the invention extend the I 2 0 Architecture to provide 
this capability. 

Figure 2 is a block diagram of networking system 2 00. 
Networking 200 includes client 300 coupled to client 400 through 
host system 230. Host system 230 may include host processor 240 
coupled to I/O processor 210 through host system or PCI bus 250. 
Within host system 230 is I/O device 260 interfaced with I/O 
processor 210. Network lines 402 are coupled to I/O device 260. 

Client 300 may be a computer that includes data input devices 
such as keyboard 302 and mouse 304 and includes visual monitor 
306. Preferably, host system 230 physically is part of client 
300, but may be remote from client 300. For example, client 300 
may be in one location and host system 230 may be in another 
location, but connected via communication channels 308 such as 
radio signals, cabling, or the Internet. 

As one example of networking system 200, host system 230 may 
be connected to client 400 through network lines 402. Network 
lines 402 may be any form of communication channel over which data 
from host system 230 may be transmitted to client 400. Client 400 
may be composed of one computer or millions of computers. 

Figure 3 illustrates a more detailed view of host processor 
240 and I/O processor 210. Within host processor 240 is an 
Operating System Specific Module (OSM) 300. Preferably, OSM 300 
is the host process software for the I 2 0 architecture. OSM 3 00 is 
coupled to Real Time Operating System (RTOS) 302. The software of 
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RTOS 302 abstracts a large portion of I/O processor 210 hardware 
from the rest of the software that runs on I/O processor 210. By 
permitting common commands to enable applications within I/O 
processor 210, RTOS 302 permits a programmer to develop a driver 
for one I/O device and get the driver running on one or many 
different I/O processors with minimal effort. 

Conventionally, drivers are used to couple devices external 
to I/O processor 210. As shown in Figure 3, Small Computer 
System interface (SCSI) device driver 312 couples SCSI controller 
310 to RTOS 302 as an example of a storage device. Networking 
402, as coupled to Ethernet 410, is coupled to RTOS 302 through 
networking driver 314 as an example of a networking device. 

RTOS 302 aids in message passing between the external devices 
and OSM 300. By conceptually treating PMON unit 64 as a device 
external to I/O processor 210, the invention takes advantage of 
message passing features of RTOS 302 by coupling PMON unit 64 to 
RTOS 302 through PMON driver 320. PMON driver 320 is a Device 
Driver Module (DDM) dedicated to performance (perf ) monitoring 
resources and may be referred to as "perf DDM" . PMON driver 320 
may operate by itself or work in conjunction with either a storage 
device or a networking device. 

The system management interface of the I 2 0 Architecture 
Specification provides for developing private parameter groups. 
Private parameter groups reside in the I/O processor memory. An 
Operating System Specific Module may be used to gather data from a 
driver. For example, private parameter groups of the invention 
preferably reside in the I/O processor memory and OSM 300 may be 
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used to gather data from PMON driver 320 of Figure 3. 

Each parameter group may include a group number, a group 
type, a group name, a description of the group, and includes one 
or more parameters belonging to the group. A parameter may be 
identified by a field number, whether the parameter is readable or 
writable, the file size of the parameter, the parameter name, and 
a description of the parameter. In one embodiment of the 
invention, three private parameter groups contain a total of 
thirty one fields reside as software in memory 38 of I/O processor 
210. 

Within performance monitoring Table 1, Table 2, and Table 3 
below are a set of parameter groups that define an embodiment of 
the invention. Table 1 is the Control Group, Table 2 is the Mode 
Control Group and Table 3 is the Mode Data Group. The contents of 
the parameter groups include the performance monitor data and 
performance monitor setup information. Software running on host 
processor 240 can gather or modify one or more parameters stored 
in these parameter groups. 
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Table 1 



Performance Monitoring Control Group 0x8 0 0 Ox 



Group Number 
Group Type 
Name 

Description 



0x8000x " " " 

SCALAR 

PERFM0N_C0NTR0L 

A table of all control parameters for hardware -based 
performance monitoring resources 



Field 
ldx 



(r/w) 



Field 
Size 



Parameter 
Name 



Description 



0 



r/w 



5 Bytes 



LockControl 



Bytes 0-3 are the 
AuthenticationKey . 

When the LockControl field is 
read by an application and the 
Locked bit is not set, the 
initiator Target Identification 
(TID) from the UtilGetParams 
message is saved, a unique 
AuthenticationKey is returned in 
these bytes of the field, and the 
Locked bit is tentatively changed 
to the set state. The application 
then has 2 seconds to lock the 
performance monitoring resources by 
sending a UtilParamsSet message 
with the proper TID to write to the 
LockControl field with the Locked 
bit set and the proper 
AuthenticationKey in Bytes 0-3. 
When this occurs, the resources 
remain locked, if they are 
currently unlocked. Otherwise, 
after the 2 second timeout period 
runs out, the locked bit is reset 
and other applications can attempt 
to lock the resources. During the 
2 second backoff period, other 
applications that read the 
LockedControl field will detect 
that the resources are already 
locked, and the AuthenticationKey 
will be set to something other than 
the proper one. 



r/w 



11 

Bytes 



Control 



Bit 0 of byte 4 is the Locked bit. 

For the Locked bit, 1 means 
locked and 0 means unlocked. When 
this bit is set along with a proper 
AuthenticationKey, the perfDDM 
saves the initiator TID, fields 
from the UtilParamsSet message. 
Every subsequent UtilParamsSet or 
UtilParamsGet message with an 
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Table 1 cont. Performance Monitoring Control Group 0x8 00 Ox 



Group I* 
Group 1 
Name 
Descrip 


lumber 
Vpe 

>tion 


0x8 00 Ox 
SCALAR 

PERFM0N_C0NTR0L 

A table of all control parameters for hardware -based 
performance monitoring resources 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 












1 

cont. 


r/w 


11 

Bytes 


Control 


initiator TID field equal to the 
saved TID value causes a watchdog 
timer to be zeroed. After 5 min., 
if no UtilParamsGet or 
UtilParamsSet messages to the 
perfDDM parameter groups with the 
initiator TID field equal to the 
saved value are received, then the 
locked state bit is cleared. This 
mechanism mediates resource 
contention between applications and 
DDMs under development, while 
preventing resource lockout due to 
halted applications. Default is 
unlocked(O) . 

NOTE: Because of the 2 second 
timeout condition, spurious 
UtilGetParams messages that read 
the ^ LockControl field should be 
avoided, since these reads would 
prevent other applications from 
being able to lock the performance 
monitoring resources during the 
backoff period. 

Bytes 0-3 contain the 
AuthenticationKey . 

The AuthenticationKey verifys that 
the configuration host application 
that is attempting to alter the is 
the one that locked the resources. 
On writes, this key must be equal 
to the last authentication key 
issued by the perfDDM. If equal, 
the host application's 
UtilParamsSet message will be 
processed normally. If not equal 
to the last issued authentication 
key, then the UtilSetParams reply 
message indicates an error. On 
reads this field returns 0. 
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Table 1 cont. Performance Monitoring Control Group 0x8000x 



Group Number 
Group Type 
Name 

Description 


0x8 00 Ox 
SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware -based 
performance monitoring resources 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 












1 

cont. 


r/w 


11 
Bytes 


Control 


Bit 0 of byte 4 is the Accumulate 
count bit. 

For the Accumulate count bit, 
0 means only report deltas for last 
interval and 1 means accumulate 
counters and time intervals . 
Default is Accumulate counters and 
time intervals (1) . 

Bit 1 of byte 4 is the Counting_pn 
bit. 

The Counting_On bit gives the 
application a quick way to turn off 
counting in the performance 
monitor. This will be used to 
limit the impact of performance 
monitoring-related bus accesses on 
the counter statistics reported. 
When an application wants to turn 
counting off, it will send the 
smallest UtilParamsSet message 
possible to access only this field. 
When counting is turned ON, the 
saved counter values for each mode 
are zeroed out. Default is 
counting off. 

Note: when Counting_On is 1, all 
writes to any other parameter group 
fields or bits than Counting__ON are 
disallowed. An error will be 
returned for such accesses. 

Note: when Count ing_On is 0 and 
all the Modellnterval fields for 
all modes are set to 0 (see group 
0x8001, field #1 below) , counting 
cannot be turned on, since no modes 
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Table 1 cont. Performance Monitoring Control Group 0x8000x 



Group Number 
Group Type 
Name 

Description 



0x8000x " ' 

SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware -based 
performance monitoring resources 



Fieldl 
dx 



(r/w) 



Field 
Size 



Parameter 
Name 



Description 



1 

cont . 



r/w 



11 
Bytes 



Control 



have been allocated counting 
intervals. Attempts to set 
Counting_On in this case will 
result in an 

12 0_PARAMS_STATUS_SCALAR_ERROR 
being returned by the operation. 

Bit 2 of byte 4 is the 
GlobalSendOnCycleEnd bit. 

If bit 2 is set, the EventData 
fields of Event Acknowledge 
Messages for the CYCLE_END event 
will contain selected rows of the 
PERFMON_MODE_DATA table group. If 
this bit is reset, Event 
Acknowledge messages contain no 
EventData. Default is reset. 

Bit 3 of byte 4 is the Pause bit. 

The Pause bit can be used to 
temporarily disable counting and 
subsequently reenable counting 
without zeroing the saved counter 
values for each mode. Default is 
OFF. 



Byte 5 is the CurrAlgorithm 

The CurrAlgorithm is used to 
set the sampling algorithm. 
Initially, there will be two 
sampling algorithms supported: 
User-configurable simple round- 
robin (0) , and distributed round- 
robin(l). Default: distributed 
round- robin. 

Bit 2 of byte 4 is the 
GlobalSendOnCycleEnd bit. 
If bit 2 is set, the EventData 
fields of Event Acknowledge 
Messages for the CYCLE_END event 
will contain selected rows of the 
PERFMON MODE_DATA table group. If 
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Table 1 cont. Performance Monitoring Control Group 0x8000x 



Group 
Number 
Group Type 
Name 

Description 


0x8 00 Ox 
SCALAR 

PERFMON__CONTROL 

A table of all control parameters for hardware -based 
performance monitoring resources 


Fieldl 
dx 


(r/ 
w) 


Field 
Size 


Parameter 
Name 


Description 


1 

cont . 


r/w 


11 Bytes 


Control 


this bit is reset, Event 
Acknowledge messages contain no 
EventData. Default is reset. 

Bit 3 of byte 4 is the Pause bit* 
The Pause bit can be used to 
temporarily disable counting and 
subsequently reenable counting 
without zeroing the saved counter 
values for each mode. Default is 
OFF. 

Byte 5 is the Curr Algorithm 

The CurrAlgorithm is used to 
set the sampling algorithm. 
Initially, there will be two 
sampling algorithms supported: 
User-configurable simple round - 
robin (0), and distributed round- 
robin (1) . Default: distributed 
round- robin. 

Byte 6 is the SampleUnits. 

The SampleUnits specifies the 
units for the selected sample 
interval : usee ( 0 ) , msec ( 1 ) , 
seci^i; , itiim j) . uerauj-U: msec. 

Bytes 7 through 10 are the 
Samplelnterval; which specifies the 
number of SampleUnits in the 
selected sample interval. Default: 
100, giving a default sample 
interval of 100 msec. If the users 
select as a sample interval less 
than the minimum sample interval, 
the actual sample interval is 
rounded up. The largest sample 
interval supported is 71 minutes. 
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Table 1 cont. Performance Monitoring Control Group 0x8000x 



Group JN 
Group 1 
Name 
Descrip 


lumber 
ype 

>tion 


0x8 00 Ox 
SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware -based 
performance monitoring resources 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 


o 


r 


1 Byte 


MaxMode 


Maximum # of modes for the 
performance monitoring resources, 
DDMs use this to indicate the 
number of modes supported by the 
underlying hardware. Typically 
these are hardware modes. 


3 


r 


1 Byte 


CurrentMode 


Current mode of the performance 
monitoring resources. This is used 
for debugging purposes. 


4 


r 


1 Byte 


MaxAlgorith 
m 


Maximum # of sample algorithms. 
DDMs can use this to indicate the 
number of algorithms that can be 
supported. Sample algorithms are 
software-controlled. 


5 


r 


1 Byte 


MinSampleUn 
its 


Specifies the units for the minimum 
supported sample interval: 
usec(O), msec(l), sec(2), min(3) . 
This value will be determined by 
the resolution of the RTOS event 
timer. 


6 


r 


4 

Bytes 


MinSampleln 
terval 


Number of MinSampleUnits in the 
minimum supported sample interval . 
This value will be determined by 
the resolution of the RTOS system 
timer rounded to the nearest 
microsecond. 


7 


r 


1 Byte 


PerfHWType 


Type of performance monitoring 
hardware available: -0 means NONE, 
1 means i960 RN or RM, all other 
values are reserved. 


11 


r 


1 Byte 


NumCounters 


Number of performance monitor 
counters supported by hardware. 


13 


r 


1 Byte 


Samplelnter 

val 

Status 


Provides an indicator of whether 
the user-selected sample interval 
is okay(0) / too small(l), or too 
large (2) , adiusted(3) . 


14 


r 


4 

Bytes 


AdjustedSam 
pie 

Interval 


Displays the rounded up sample 
interval, e.g. the user-configured 
sample interval rounded up to the 
next integer multiple of the system 
timer resolution. 
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Notes for Table 1: The group numbers for the performance 
monitoring private parameter groups preferably will be determined 
by the I 2 0 SIG assignee of the inventor. Moreover, the 
AuthenticationKey may be based on a global timer to guarantee 
uniqueness. In addition, clearing the locked stage bit after five 
minutes preferably will not prevent resource contention between 
multiple applications on the same host since messages from these 
all have the same initiator TID value. Originally, it was thought 
that the initiator context field could be used for this purpose. 
However, some host operating systems control this field. 
Applications have no control over this field. 
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Table 2 Performance Monitoring Mode Control Group 0x8001 



Group 
Number 
Group Type 
Name 

Description 


0x8001 
Table 

PERFM0N_M0DE_C0NTR0L 

A table of mode-specific control parameters for the 
performance monitoring resources. 


Fieldl 
dx 


(r/ 
w) 


Field 
Size 


Parameter 
Name 


Description 












0 


r 


1 Byte 


Mode 


Performance monitoring mode to be 
controlled, 1-7 . 


1 


r/w 


8 Bytes 


ModeControl 


Bytes 0-3 contain the 
AuthenticationKey 

The AuthenticationKey is used 
to verify that the host application 
which is attempting to alter the 
configuration is the one that 
locked the resources. On writes, 
this key must be equal to the last 
authentication key issued by the 
perfDDM. If equal, the host 
application's UtilParamsSet message 
will be processed normally. If not 
equal to the last issued 
authentication key, then the 
UtilSetParams reply message 
indicates an error. On reads, 
bytes 3-7 return 0. 

Bytes 4 - 7 of this field contain 
the Modelntervals . 

The Modelntervals represents 
the number of selected sample 

-LllUtri- VCtXo v_lt:kJ._LC.Cl L-CtJ. tU Ct \JCiX. Lltlilul 

mode when using the simple round- 
robin; writes to this sub-field 
cause all modes counter values to 
be zeroed, if not already done. A 
flag, counts^ zeroed, in perfDDM 
will track this; it will be set 
when the first Mode Time is zeroed, 
and reset when counting is turned 
on. This will reduce the impact on 
the overall system that would occur 
if the Mode Times for more than one 
mode were altered. This defaults 
to 10 times the sampling interval. 



Note for Table 2: No rowDelete or rowAdd operations need be 
supported by perfDDM since these fields are present for all modes. 
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Table 3 Performance Monitoring Mode Data Group 0x8002 



Group Number 
Group Type 
Name 

Description 


0x8002 
Table 

PERFMON_MODE_DATA 

A table of data parameters for all counters in each 
mode. Each row pertains to an individual mode. 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 












0 


r 


1 Byte 


Mode 


Performance monitoring mode for the 
data, 1-7. 


1 


r 


8 

Bytes 


CurrTime 


Current accumulated value of GTSR 
register plus rollover bits. 

This value accumulates over 
multiple sample intervals until the 
end of the timeslice is reached. 
This is updated whenever an 
interval ends, it cne nose 
application disables counting in 
the middle of an interval, or if 
the host application requests the 
data in the middle of an interval. 
Main use of this is to determine 

J. J. WW OL'CllC Ui. \^ Li. J- -L Cill L. L-IJ.ti Vj-O. L-CL _L o 

when very long interval times are 
beincr u^ed TTnit^ ayp t~Vit=* tj^ti r>ri 
of the GTSR clock. 


2 


r 


8 

Bytes 


SigmaTime 


Accumulated time for thic; mndp at 
end of last completed interval. 
Units are the period of the GTSR 
clock. 


3 


r 


8 

Bytes 


EndingTime 


Value of GTSR plus rollover bits at 
the end of the last completed 
interval . 

This is compared to CurrTime 
to determine the relative currency 
of the counter data. 


4 


r 


8 

Bytes 


Counter 01 


PEC01 counter value at end of last 
completed interval. 


5 


r 


8 

Bytes 


Counter02 


PEC 02 counter value at end of last 
completed interval. 


6 


r 


8 

Bytes 


Counter 03 


PEC 03 counter value at end of last 
completed interval. 


7 


r 


8 

Bytes 


Counter 04 


PEC 04 counter value at end of last 
completed interval. 


8 


r 


8 

Bytes 


Counter 05 


PECOS counter value at end of last 
completed interval. 


9 


r 


8 

Bytes 


Counter 06 


PEC 06 counter value at end of last 
completed interval. 


10 


r 


8 

Bytes 


Counter07 


PECO 7 counter value at end of last 
completed interval. 
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Table 3 cont. Performance Monitoring Mode Data Group 0x8002 



Group Number 
Group Type 
Name 

Description 


0x8002 
Table 

PERFMON_MODE_DATA 

A table of data parameters for all counters in each 
mode* Each row pertains to an individual mode. 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 












11 


r 


8 

Bytes 


Counter 08 


PECO 8 counter value at end of last 
completed interval. 


12 


r 


8 

Bytes 


Counter 09 


PEC 09 counter value at end of last 
completed interval. 


13 


r 


8 

Bytes 


Counter 10 


PEC10 counter value at end of last 
completed interval. 


14 


r 


8 

Bytes " 


Counterll 


PECll counter value at end of last 
completed interval. 


15 


r 


8 

Bytes 


Counterl2 


PEC12 counter value at end of last 
completed interval. 


16 


r 


8 

Bytes 


Counterl3 


PEC13 counter value at end of last 
completed interval. 


17 


r 


8 

Bytes 


Counterl4 


PEC 14 counter value at end of last 
completed interval. 



Notes for Table 3 : SigmaTime and all counter values are 



initialized to zero. Also, if delta counting is selected, when 
the performance monitor mode is first entered, Sigma time and all 
the counter values are zeroed. In cumulative counting, the Sigma 
time and all of the counter values are not zeroed when the mode is 
first ^entered, and when the mode is exited the values in the GTSR 
and all of the counters are added to the values in this parameter 
group , 
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Figure 4 is a flow diagram of a method of operation 500 of 
the invention. Method 500 may be implemented in a computer 
readable storage medium containing executable computer program 
instructions which when executed cause the networking system to 
perform method 500. Also, method 500 may be implemented in a 
distributed readable storage medium containing executable computer 
program instructions which when executed cause an I/O processor to 
perform method 500. 

At initialization 502 of method 500, the software in local 
memory 38 of I/O processor 210 initializes. This creates a 
performance monitoring (PMON) storage table in local memory 38. 
The PMON storage table can store and keep separate the various 
pieces of information retrieved from PMON unit 64 and placed in 
that table. Additionally, initializing the software in local 
memory 38 also causes PMON driver 320 to register as a private 
driver with RTOS 302 of I/O processor 210 since PMON driver 320 is 
not a networking type driver or a storage type driver. 

In operation, a user of client 300 or client 400 of Figure 2 
initiates a performance monitor application at step 506 that 
generates a selection screen at visual monitor 306. The selection 
screen allows the user to select those events on primary PCI bus 
16, secondary PCI bus 18, or internal bus 26 for which the user 
desires to compile data. The user may also specify the time 
periods that the user desires to see the compiled data at visual 
monitor 306. In regards to the events on primary PCI bus 16, 
secondary PCI bus 18, or internal bus 26, the I 2 0 Architecture 
divides monitorable events into occurrence events and duration 
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events. Occurrence events is counted each time the event occurs. 
For a duration event, a counter counts the number of clocks during 
which a particular condition or set of conditions is true. A 
total of ninety-eight events may be monitored, subject to the 
availability of event counters. 

At step 510, the user selects at visual monitor 306 those 
events on primary PCI bus 16, secondary PCI bus 18, or internal 
bus 26 for which the user desires to compile data by entering 
their selection in the selection screen. After selecting those 
events the user desired monitored, the information entered into 
the selection screen at visual monitor 306 is sent at step 514 to 
OSM 300 of host processor 240 as a message request that specifies 
specific pieces of data from PMON unit 64. OSM 300 relays this 
message request to RTOS 302 at step 518 without an understanding 
of whether the message request is for a networking type driver, a 
storage type driver, or a private driver. However, RTOS 302 does 
recognize this message request as a request for the PMON driver 
320, previously registered as a private driver with RTOS 302. 

Using the fields of the three private parameter groups 
residing in local memory 38 of I/O processor 210, RTOS 302 
translates the message request at step 522 into the appropriate 
parameter of the private group parameters of the invention. 
Parameters of the private group parameters are set out in Table 1, 
Table 2, and Table 3 above. 

At step 526, RTOS 302 sends this translated request to PMON 
driver 320. In response to receiving the translated request, PMON 
driver 320 queries PMON unit 64 at step 530 for the specific 
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pieces of data requested by the user. The query to PMON unit 64 
results in the requested pieces of data being sent to PMON driver 
320 at step 534. PMON driver 320, in turn, sends this data to the 
PMON storage table in local memory 38 at step 538 where the data 
will be compiled and stored for dispatch to the user at the time 
periods specified by the user at visual monitor 3 06 in the 
selection screen. The data preferably is saved in such a way that 
another application could be written that would take this data and 
present it to client 300, for example, in a meaningful fashion. 

At the time periods specified by the user in the selection 
screen, I/O processor 210 sends the compiled performance 
monitoring data back to the user through RTOS 302 and OSM 300 at 
step 542 . The compiled performance monitoring data may also be 
directed to other locations within networking system 200 for 
purposes such as to cause an effect. For example, the data may be 
sent to an interpreting device that determines whether the server 
of host system 23 0 is too hot based on the compiled performance 
monitoring data. If so, the interpreting device may generate a 
message that causes an internal fan to turn on. 

The exemplary embodiments described herein are provided 
merely to illustrate the principles of the invention and should 
not be construed as limiting the scope of the subject matter of 
the terms of the claimed invention. The principles of the 
invention may be applied toward a wide range of systems to achieve 
the advantages described herein and to achieve other advantages or 
to satisfy other objectives, as well. 
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CLAIMS 

What is claimed is: 



1 1. A method for retrieving performance monitor data from a 

2 processor, comprising: 

3 registering a performance monitoring driver as a private 

4 driver with a real time operating system (RTOS) of the processor, 

5 wherein the performance monitoring driver is coupled to a 

6 performance monitoring unit (PMU) ; 

7 selecting events within the processor to gather data on; 

8 sending the selected events as a message request to the RTOS; 

9 translating the message request into parameters based on a 
410 set of private group parameters that are accessible by the RTOS; 
QL1 sending the message request as a translated request to the PMU; 
412 returning the pieces of data requested by the translated 

gl3 request to the performance monitoring driver; and 

f-3.4 sending the pieces of data to a location specified in the 



{%5 message request. 



^1 2. The method of claim 1, further comprising: 

2 prior to registering the performance monitoring driver, 

3 initializing software in memory of the processor. 

1 3. The method of claim 1, further comprising: 

2 subsequent to registering the performance monitoring driver, 

3 initiating a performance monitor application that generates a 

4 selection screen at a visual monitor coupled to the processor 

5 through a host processor, wherein selecting events within the 

6 processor on which to gather data includes selecting the events at 
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7 the selection screen. 

1 4. The method of claim 3, wherein selecting events within 

2 the processor on which to gather data includes selecting one of 

3 ninety eight events arranged in connection with the selection 

4 screen* 

1 5. The method of claim 1, wherein sending the selected 

2 events as a message request to the real time operating system 

3 includes sending the message request through an operating system 

4 specific module of a host processor. 



0 1 6. The method of claim 1, wherein sending the translated 

41 2 request to the performance monitoring unit includes sending the 

gn 3 translated request through the performance monitoring driver. 

fn 1 7. The method of claim 1, wherein the set of private group 

^ 2 parameters includes at least one of (i) control parameters for 
hardware -based performance monitoring resources, (ii) mode- 

4 specific control parameters for a performance monitoring resource, 

m . 

* 5 and (iii) data parameters for at least one mode in one counter. 

1 8. The method of claim 7, wherein the set of private group 

2 parameters includes at least one of the following parameters: 

3 AdjustedSample, Control, CounterOl, Counter 02, Counter03, 

4 Counter04, CounterOS, Counter06, Counter07, Counter08, Counter09, 

5 CounterlO, Counterll, Counterl2, Counterl3, Counterl4, 

6 CurrentMode, CurrTime, EndingTime, Interval, LockControl, 

7 MaxAlgorithm, MaxMode, MinSamplelnterval , MinSampleUnits, Mode, 
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8 ModeControl, NumCounters, PerfHWType, Samplelnterval, SigmaTime, 

9 and Status. 



1 9. The method of claim 1, further comprising: 

2 subsequent to registering the performance monitoring driver, 

3 generating performance monitoring storage tables within memory of 

4 the processor. 

1 10. The method of claim 9, further comprising: 

2 subsequent to returning the pieces of data requested by the 

3 translated request to the performance monitoring driver, sending 

4 the pieces of data to the performance monitoring storage tables. 

J: 1 11. The method of claim 1, wherein sending the pieces of 

fri 2 data to a location specified in the message request further 

"are* 

y s 3 includes sending the pieces of data at a time period specified in 

^ 4 the message request. 

^! 1 12. The method of claim 11, wherein the location specified 

4! 2 in the message is a client coupled to the processor through a 

fi3 network. 

1 13. The method of claim 11, wherein the location specified 

2 in the message is a client coupled to the processor through a host 

3 processor. 

1 14. The method of claim 11, wherein the location specified 

2 in the message is a means for determining whether the server of a 

3 host system is too hot based on the pieces of data returned from 

4 the performance monitoring unit. 
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1 15. The method of claim 14, wherein the means for 

2 determining is an interpreting device, the method further 

3 comprising: 

4 generating a message in the interpreting device that causes a 

5 fan internal to the host system to turn on in response to the 

6 pieces of data returned from the performance monitoring unit. 

1 16. In a networking system including a host system having a 



2 host processor coupled to an processor through a peripheral 

3 components interconnect bus, and including a first client coupled 

4 to the host system and a second client coupled to the host system 

5 through network lines, a computer readable storage medium 

Q 6 containing executable computer program instructions which when 

yg 7 executed cause an processor to perform a method comprising: 
ff% 8 registering a performance monitoring driver as a private 

^ 9 driver with a real time operating system of the processor, wherein 

fto the performance monitoring driver is coupled to a performance 

w|l monitoring unit; 

il2 selecting events within the processor on which to gather 

13 data; 

14 sending the selected events as a message request to the real 

15 time operating system; 

16 translating the message request into parameters based on a 

17 set of private group parameters that are accessible by the real 

18 time operating system; 

19 sending the message request as a translated request to the 

20 performance monitoring unit; 
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21 returning the pieces of data requested by the translated 

22 request to the performance monitoring driver; and 

23 sending the pieces of data to a location specified in the 

24 message request. 

1 17. The computer readable storage medium of claim 16, 

2 wherein the set of private group parameters includes at least one 

3 of (i) control parameters for hardware -based performance 

4 monitoring resources/ (ii) mode-specific control parameters for a 

5 performance monitoring resource, and (iii) data parameters for at 

6 least one mode in one counter. 

Hi 18. The computer readable storage medium of claim 17, the 

Q2 method further comprising: 

il 3 subsequent to registering the performance monitoring driver, 

g|4 generating performance monitoring storage tables within memory of 

the processor; and 
^6 subsequent to returning the pieces of data requested by the 

7 translated request to the performance monitoring driver, sending 
f ^8 the pieces of data to the performance monitoring storage tables. 

1 19. A distributed readable storage medium containing 

2 executable computer program instructions which when executed cause 

3 an processor to perform a method for retrieving performance 

4 monitor data from the processor, the method comprising: 

5 registering a performance monitoring driver as a private 

6 driver with a real time operating system of the processor, wherein 

7 the performance monitoring driver is coupled to a performance 

8 monitoring unit; 
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9 selecting events within the processor on which to gather 

10 data; 

11 sending the selected events as a message request to the real 

12 time operating system; 

13 translating the message request into parameters based on a 

14 set of private group parameters that are accessible by the real 

15 time operating system; 

16 sending the message request as a translated request to the 

17 performance monitoring unit; 

18 returning the pieces of data requested by the translated 

19 request to the performance monitoring driver; and 

tfSO sending the pieces of data to a location specified in the 

Q21 message request. 

ft* 1 20. The distributed readable storage medium of claim 19, 

™ 2 wherein the set of private group parameters includes at least one 

[ps 3 of (i) control parameters for hardware -based performance 

S4 monitoring resources, (ii) mode-specific control parameters for a 

■|f5 performance monitoring resource, and (iii) data parameters for at 

6 least one mode in one counter. 

1 21. The distributed readable storage medium of claim 20, the 

2 method further comprising: 

3 subsequent to registering the performance monitoring driver, 

4 generating performance monitoring storage tables within the memory 

5 of the processor; and 

6 subsequent to returning the pieces of data requested by the 

7 translated request to the performance monitoring driver, sending 
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the pieces of data to the performance monitoring storage tables. 
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ABSTRACT 

The present invention relates to retrieving performance 
monitor data from an I/O processor, A perf ormance monitoring 
driver coupled to a performance monitoring unit is registered as a 
private driver with a real time operating system of the I/O 
processor. Events within the I/O processor are selected on which 
to gather data. The selected events are sent as a message request 
to the real time operating system. The message request is 
translated into the appropriate parameters based on a set of 
private group parameters that are accessible by the real time 
operating system. The message request is sent as a translated 
request to the performance monitoring unit. The pieces of data 
requested by the translated request are returned to the 
performance monitoring driver. The pieces of data then are sent 
to a location specified in the message request. 
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P44,976; Daniel E Ovaoezian, Reg. No. 41 ,236; Babak Redjaian, Reg. No. 42,096; Wilfiam F. Ryamt, Reg. 
No. 44,313; James H. Salter, Reg. No. 35,668; Wiffiam W. Schaal, Reg. No. 39,018; James C. Scheller, 
Reg. No. 31,195; Jeffrey S. Smith, Reg. No. 39,377; Maria McCcnnack Sobtino, Reg. No. 31,639; Stanley 
W. Sokoloff, Reg. No. 25,128; Judith A. Szepesi, Reg. No. 39,393; Vincent P. Tassinari, Reg. No. 42,179; 
Edwin H. Taylor, Reg. No. 25,129; George G. C Tseng, Reg. No. 41,355; Joseph A. TwarowskL Reg. No. 
42,191; Lester J. Vincent, Reg. No. 31,460; John Patrick Ward, Reg. No. 40,216; Charles T. J. Weigell, 
Reg. No. 43,398; Kirk D. Williams, Reg. No. 42^29; James M. Wo, Reg. No. P45,241; Steven D. Yates, 
Reg. No. 42,242; Ben J. Yorks, Reg. No. 33,609; and Norman Zafinan, Reg. No. 26,250; my attorneys; and 
Justin M. Dillon. Reg. No. 42,486, Edwin.AL Sloane, Reg. No. 34.728; and John F. Travis, Reg. No. 
43,203; my patent agents, with offices located at 12400 Wilshtre Boulevard, 7th Floor. Los Angeles, California 
90025, telephone (3 10) 207-3800, and Alan K. Aldous, Reg. No. 31,905; Robert D. Anderson, Reg. No. 33.826; Joseph 
R. Bond, Reg. No. 3«,458; Richard C. Calderwood, Reg No. 35,468; Jeffrey S. Draeger, Reg. No. 41 ,000; Cynthia Thomas 
Faatt, Reg No. 39,973; Sean Fitzgerald. Reg. No. 32,027; SetnZ. Kalson, Reg. No. 40,670; David J. Kaplan, Reg. No. 41.103; 
Charles A. M«ho. Reg. No. 41,199; Leo V. Novakoski, Reg. No. 37,1 98; Naomi OWnata, Reg. No. 39320; Thomas C 
Reynolds, Reg. No. 32,488; Made Sceiey, Reg. No. 31299; Steven P. Skabtat. Reg. No. 36,279; Howaid A. Skaisc, Reg. No. 
36,008; Steven C Stewart, Reg. No. 33355; Raymond J, Werner. Reg. No. 34,752; and Charles K. Young. Reg. No. 39,435; my 
Send correspondence to Vincent P. Tassinari. Reg Nr>. 4? 170 BLAKELY, SOKOLOFF. TAYLOR & 

(N«me of Money or Agent) 

ZAFMAN LLP, 12400 Wilshhe Boulevard, 7& Floor, Los Angeles, California 90025 and direct telephone 

calls to Vincent P Tasrinnri. Reg. No. 42.170. ttlffl 7fT7.<Mm 
(Name of Attorney or Agent) 

I hereby declare that all statements made herein of my own knowledge are true and that all statements male on 
iirformation and belief axe believed to be one; and farther that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine or i mprisonment, or both, under Section 
1001 of Tide 18 of the United Stales Code and that such willful false statements may jeopardize the validity of 
die application or any patent issued thereon. 

Fall Name of Sole/first Inventor (riven name, family name) Susan C Kw^aker 

teiiatin^ Dae 



Inventor's Sis 



Residence Chandler, Arizona Citizenship U.S-A. 

(City. Sou) ~ ~ (Country) 

P. O. Address 4909 W. Joshua Blvd. *2 11 2 s 

Chandler, Arizona 85226 Ujg.A. 



DodsetNdO42390J6518 



Name of Second/Joint Inventor ^n^f^ yame) 
Inventor's Signature _ 

Residencc -J&iLbjsL^Aj 



(City, Stat?) 

P.O. Address ^ ? o. F U^. nV . ^ 



Mark L. Brown 



QMzcnship U.S.A. 



(Cottony) 



Full Name of Third/Joint Inventor ^ n ^ ^ name) 



Linda M. Rnho^fc 



Residence 



(City , State) 



Date 9-36- 9? 



Citizenship U.S.A. 



(Country) 



Full Name of Fourth/Joint Inventor ^ ^ farafly ^ 
Inventor's Signature ^jtf^Ci 
Residence Pheonix. Arizona 

(City , Stat*) ' 

P. O. Address 14813 S.24thSm*» 



Pheonix. Arizona 85048 n < a 



Dae 



William C. Arthur. Jr. 



. atizenship U.S.A. 



(Country) 



Full Name of Rfth/Joint Inventor ^ ^ ^ ^ 

Inventor's Stgnature^ 

Residence ' 



P. O. Address 



(City , Stau) 



Citizenship 



(Country) 



D«*eiNo.042390J>6Sl8 



